智能论文笔记

Constrained tensor factorization for computational phenotyping and mortality prediction in patients with cancer

Francisco Y Cai , Chengsheng Mao , Yuan Luo

分类：机器学习

2021-12-24

背景：在美国的电子健康记录（EHR）的日益越来越多的采用创造了可计算数据的Trovers，已经应用了机器学习方法来提取有用的见解。表示为矩阵（张量）的三维类似物的EHR数据被分解成可以被解释为计算表型的二维因子。方法：我们将受限的张量分解施加到2015年至2015年西北医学企业数据仓库中患有乳腺，前列腺，结直肠癌或肺癌患者群组中的增殖和预测死亡率。在我们的实验中，我们使用监督期检查在分解算法中，通过医学指示过滤张量相同，并在分解过程中纳入额外的健康（SDOH）协变量的社会决定因素。我们定性地评估了所产生的计算表型，并通过评估它们在曲线（AUC）统计下的区域预测五年死亡率的能力。结果：医疗指示过滤导致更简洁和可解释的表型。死亡预测性能（AUC）在不同的实验条件下变化，癌症类型（乳腺：0.623-0.694，前列腺：0.603 - 0.750，结直肠：0.523-0.641和Lung：0.517 - 0.623）。通常，利用监督期的使用和SDOH协变量的结合改善了预测性能。结论：约束张量分解，适用于癌症患者的稀疏EHR数据，可以发现预测五年死亡率的计算表型。将SDOH变量的结合到分解算法是一种易于实现和有效的方法来提高预测性能。

translated by 谷歌翻译

Natural language processing to identify lupus nephritis phenotype in electronic health records

Yu Deng , Jennifer A. Pacheco , Anh Chung , Chengsheng Mao , Joshua C. Smith , Juan Zhao , Wei-Qi Wei , April Barnado , Chunhua Weng , Cong Liu

分类：机器学习

2021-12-20

Systemic Lupus红斑（SLE）是一种罕见的自身免疫疾病，其特征是令人无法预测的耀斑和缓解的速度，具有不同的表现形式。狼疮性肾炎，SLE用于器官损伤和死亡率的主要疾病表现之一，是卢布斯分类标准的关键组成部分。因此，准确地鉴定电子健康记录（EHRS）中的狼疮性肾炎将使大型队列观察研究和临床试验有益于患者人口的表征对于招聘，研究设计和分析至关重要。可以通过程序代码和结构化数据来认可狼疮肾炎，例如实验室测试。然而，记录狼疮肾炎的其他关键信息，例如来自肾脏活检和先前的医学史叙事的组织学报告，需要复杂的文本处理，以从病理报告和临床笔记中挖掘信息。在这项研究中，我们开发了使用EHR数据识别鉴定狼疮肾炎的血管肾炎，而不使用自然语言处理（NLP）。我们开发了四种算法：仅使用结构化数据（基线算法）和使用不同NLP模型的三种算法的规则的算法。这三种NLP模型基于正则化逻辑回归，并使用不同的特征集，包括积极提及概念独特标识符（Cue），耐备的外观数量，以及三个部件的混合物。基线算法和最佳执行的NLP算法在Vanderbilt University Center（VUMC）的数据集上验证了外部验证。我们最佳地执行来自结构化数据，正则表达式概念和映射的特征的NLP模型，与基线狼疮性肾炎算法相比，在NMEDW（0.41 VS 0.79）和VUMC（0.62 VS 0.96）数据集中有所改善。

translated by 谷歌翻译

Medication Recommendation and Lab Test Imputation via Graph Convolutional Networks

Chengsheng Mao , Liang Yao , Yuan Luo

分类：机器学习 | 人工智能 | (统计)机器学习

2019-03-31

实验室检测和药物处方是日常临床实践中最重要的两种惯例。开发一种人工智能系统，可以自动制造实验室测试借助和药物建议可以节省潜在的冗余实验室测试，并告知医生更有效的处方。我们展示了一个智能医疗系统（名为Medgcn），可以根据其不完整的实验室测试自动推荐患者的药物，甚至可以准确估计未被采取的实验室值。在我们的系统中，我们将多种类型的医疗实体之间的复杂关系与其在异构图中的固有功能集成。然后，我们模拟图表以了解基于图形卷积网络（GCN）图表中的每个实体的分布式表示。通过图形卷积网络的传播，实体表示可以包含多种类型的医疗信息，可以使多种医疗任务受益。此外，我们介绍了交叉正则化策略，以减少多任务之间的交互的多任务培训过度装备。在本研究中，我们构建一个图形，以将4种类型的医疗实体，即患者，遇到，实验室测试和药物相关联，并应用图形神经网络来学习用于药物推荐和实验室测试贷款的节点嵌入。我们在两个现实世界数据集上验证了我们的Medgcn模型：nmedw和mimic-III。两个数据集的实验结果表明，我们的模型可以在两个任务中表现出最先进的。我们认为，我们的创新系统可以提供有希望和可靠的方法来帮助医生制作药物处置处方，并节省潜在的冗余实验室测试。

translated by 谷歌翻译

Graph Convolutional Networks for Text Classification

Liang Yao , Chengsheng Mao , Yuan Luo

分类：

2018-09-15

Text classification is an important and classical problem in natural language processing. There have been a number of studies that applied convolutional neural networks (convolution on regular grid, e.g., sequence) to classification. However, only a limited number of studies have explored the more flexible graph convolutional neural networks (convolution on non-grid, e.g., arbitrary graph) for the task. In this work, we propose to use graph convolutional networks for text classification. We build a single text graph for a corpus based on word co-occurrence and document word relations, then learn a Text Graph Convolutional Network (Text GCN) for the corpus. Our Text GCN is initialized with one-hot representation for word and document, it then jointly learns the embeddings for both words and documents, as supervised by the known class labels for documents. Our experimental results on multiple benchmark datasets demonstrate that a vanilla Text GCN without any external word embeddings or knowledge outperforms state-of-the-art methods for text classification. On the other hand, Text GCN also learns predictive word and document embeddings. In addition, experimental results show that the improvement of Text GCN over state-of-the-art comparison methods become more prominent as we lower the percentage of training data, suggesting the robustness of Text GCN to less training data in text classification.

translated by 谷歌翻译

More is Better: A Database for Spontaneous Micro-Expression with High Frame Rates

Sirui Zhao , Huaying Tang , Xinglong Mao , Shifeng Liu , Hanqing Tao , Hao Wang , Tong Xu , Enhong Chen

分类：计算机视觉

2023-01-03

As one of the most important psychic stress reactions, micro-expressions (MEs), are spontaneous and transient facial expressions that can reveal the genuine emotions of human beings. Thus, recognizing MEs (MER) automatically is becoming increasingly crucial in the field of affective computing, and provides essential technical support in lie detection, psychological analysis and other areas. However, the lack of abundant ME data seriously restricts the development of cutting-edge data-driven MER models. Despite the recent efforts of several spontaneous ME datasets to alleviate this problem, it is still a tiny amount of work. To solve the problem of ME data hunger, we construct a dynamic spontaneous ME dataset with the largest current ME data scale, called DFME (Dynamic Facial Micro-expressions), which includes 7,526 well-labeled ME videos induced by 671 participants and annotated by more than 20 annotators throughout three years. Afterwards, we adopt four classical spatiotemporal feature learning models on DFME to perform MER experiments to objectively verify the validity of DFME dataset. In addition, we explore different solutions to the class imbalance and key-frame sequence sampling problems in dynamic MER respectively on DFME, so as to provide a valuable reference for future research. The comprehensive experimental results show that our DFME dataset can facilitate the research of automatic MER, and provide a new benchmark for MER. DFME will be published via https://mea-lab-421.github.io.

translated by 谷歌翻译

Mapping smallholder cashew plantations to inform sustainable tree crop expansion in Benin

Leikun Yin , Rahul Ghosh , Chenxi Lin , David Hale , Christoph Weigl , James Obarowski , Junxiong Zhou , Jessica Till , Xiaowei Jia , Troy Mao

分类：计算机视觉 | 机器学习

2023-01-01

Cashews are grown by over 3 million smallholders in more than 40 countries worldwide as a principal source of income. As the third largest cashew producer in Africa, Benin has nearly 200,000 smallholder cashew growers contributing 15% of the country's national export earnings. However, a lack of information on where and how cashew trees grow across the country hinders decision-making that could support increased cashew production and poverty alleviation. By leveraging 2.4-m Planet Basemaps and 0.5-m aerial imagery, newly developed deep learning algorithms, and large-scale ground truth datasets, we successfully produced the first national map of cashew in Benin and characterized the expansion of cashew plantations between 2015 and 2021. In particular, we developed a SpatioTemporal Classification with Attention (STCA) model to map the distribution of cashew plantations, which can fully capture texture information from discriminative time steps during a growing season. We further developed a Clustering Augmented Self-supervised Temporal Classification (CASTC) model to distinguish high-density versus low-density cashew plantations by automatic feature extraction and optimized clustering. Results show that the STCA model has an overall accuracy of 80% and the CASTC model achieved an overall accuracy of 77.9%. We found that the cashew area in Benin has doubled from 2015 to 2021 with 60% of new plantation development coming from cropland or fallow land, while encroachment of cashew plantations into protected areas has increased by 70%. Only half of cashew plantations were high-density in 2021, suggesting high potential for intensification. Our study illustrates the power of combining high-resolution remote sensing imagery and state-of-the-art deep learning algorithms to better understand tree crops in the heterogeneous smallholder landscape.

translated by 谷歌翻译

Transformer in Transformer as Backbone for Deep Reinforcement Learning

Hangyu Mao , Rui Zhao , Hao Chen , Jianye Hao , Yiqun Chen , Dong Li , Junge Zhang , Zhen Xiao

分类：机器学习 | 人工智能 | 机器人

2022-12-30

Designing better deep networks and better reinforcement learning (RL) algorithms are both important for deep RL. This work focuses on the former. Previous methods build the network with several modules like CNN, LSTM and Attention. Recent methods combine the Transformer with these modules for better performance. However, it requires tedious optimization skills to train a network composed of mixed modules, making these methods inconvenient to be used in practice. In this paper, we propose to design \emph{pure Transformer-based networks} for deep RL, aiming at providing off-the-shelf backbones for both the online and offline settings. Specifically, the Transformer in Transformer (TIT) backbone is proposed, which cascades two Transformers in a very natural way: the inner one is used to process a single observation, while the outer one is responsible for processing the observation history; combining both is expected to extract spatial-temporal representations for good decision-making. Experiments show that TIT can achieve satisfactory performance in different settings, consistently.

translated by 谷歌翻译

STRUDEL: Structured Dialogue Summarization for Dialogue Comprehension

Borui Wang , Chengcheng Feng , Arjun Nair , Madelyn Mao , Jai Desai , Asli Celikyilmaz , Haoran Li , Yashar Mehdad , Dragomir Radev

分类：自然语言处理

2022-12-24

Abstractive dialogue summarization has long been viewed as an important standalone task in natural language processing, but no previous work has explored the possibility of whether abstractive dialogue summarization can also be used as a means to boost an NLP system's performance on other important dialogue comprehension tasks. In this paper, we propose a novel type of dialogue summarization task - STRUctured DiaLoguE Summarization - that can help pre-trained language models to better understand dialogues and improve their performance on important dialogue comprehension tasks. We further collect human annotations of STRUDEL summaries over 400 dialogues and introduce a new STRUDEL dialogue comprehension modeling framework that integrates STRUDEL into a graph-neural-network-based dialogue reasoning module over transformer encoder language models to improve their dialogue comprehension abilities. In our empirical experiments on two important downstream dialogue comprehension tasks - dialogue question answering and dialogue response prediction - we show that our STRUDEL dialogue comprehension model can significantly improve the dialogue comprehension performance of transformer encoder language models.

translated by 谷歌翻译

Generation-Augmented Query Expansion For Code Retrieval

Dong Li , Yelong Shen , Ruoming Jin , Yi Mao , Kuan Wang , Weizhu Chen

分类：人工智能 | 自然语言处理

2022-12-20

Pre-trained language models have achieved promising success in code retrieval tasks, where a natural language documentation query is given to find the most relevant existing code snippet. However, existing models focus only on optimizing the documentation code pairs by embedding them into latent space, without the association of external knowledge. In this paper, we propose a generation-augmented query expansion framework. Inspired by the human retrieval process - sketching an answer before searching, in this work, we utilize the powerful code generation model to benefit the code retrieval task. Specifically, we demonstrate that rather than merely retrieving the target code snippet according to the documentation query, it would be helpful to augment the documentation query with its generation counterpart - generated code snippets from the code generation model. To the best of our knowledge, this is the first attempt that leverages the code generation model to enhance the code retrieval task. We achieve new state-of-the-art results on the CodeSearchNet benchmark and surpass the baselines significantly.

translated by 谷歌翻译

Pay Attention to Your Tone: Introducing a New Dataset for Polite Language Rewrite

Xun Wang , Tao Ge , Allen Mao , Yuki Li , Furu Wei , Si-Qing Chen

分类：自然语言处理

2022-12-20

We introduce \textsc{PoliteRewrite} -- a dataset for polite language rewrite which is a novel sentence rewrite task. Compared with previous text style transfer tasks that can be mostly addressed by slight token- or phrase-level edits, polite language rewrite requires deep understanding and extensive sentence-level edits over an offensive and impolite sentence to deliver the same message euphemistically and politely, which is more challenging -- not only for NLP models but also for human annotators to rewrite with effort. To alleviate the human effort for efficient annotation, we first propose a novel annotation paradigm by a collaboration of human annotators and GPT-3.5 to annotate \textsc{PoliteRewrite}. The released dataset has 10K polite sentence rewrites annotated collaboratively by GPT-3.5 and human, which can be used as gold standard for training, validation and test; and 100K high-quality polite sentence rewrites by GPT-3.5 without human review. We wish this work (The dataset (10K+100K) will be released soon) could contribute to the research on more challenging sentence rewrite, and provoke more thought in future on resource annotation paradigm with the help of the large-scaled pretrained models.

translated by 谷歌翻译